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(ii) TITLE OF INVENTION: A VACC INE- I NDUCE D HEPATITIS B VIRAL 
STRAIN AND USES THEREOF 

(iii) NUMBER OF SEQUENCES: 11 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Ladas & Parry 

( B ) STREET: 26 West 61 Street 

(C) CITY: New York 

(D) STATE: New York 

(E) COUNTRY: USA 

(F) ZIP: 10023 

(v) COMPUTER PEAE>ABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC- DOS /MS -DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER : 09/719, 5 33 

(B) FILING DATE : 13-DEC-2000 

(C) CLASSIFICATION: 435 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: PCT / SG9 8 / 0 0 0 4 5 

(B) FILING DATE: 19-JUN-1998 

( viii ) ATTORNEY/AGENT INFORMATION : 

■A) NAME: Miller, Cynthia R. 

-■E) REGISTRATION NUMBER : 34, 678 

f C ) P S FERENCF / DOCKET NUMBER :U- 013108-9 

( i x ) T E LECOMMUN I CAT I ON I N FORMAT I ON : 
?A) TELE F HONE : (212) 708-1890 

(2) INFORMATION FOF. SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3215 base pairs 

(B) TYPE: nucleic acid 

(C) 3TF.ANDEDNESS: double 




(D) TOPOLOGY: circular 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

CTCCACCACT TTCCACCAAA CTCTTCAAGA TCCCAGAGTC AGGGCCCTGT ACTTTCCTGC 
60 



TGGTGGCTCC AGTTCAGGAA CAGTGAGCCC TGCTCAGAAT ACTGTCTCTG CCATATCGTC 1 
2 0 

AATCTTATCG AAGACTGGGG ACCCTGTACC GAACATGGAG AACATCGCAT CAGGACTCCT 1 

8 0 

AGGACCCCTG CTCGTGTTAC AGGCGGGGTT TTTCTTGTTG ACAAAAATCC TCACAATACC 2 
40 

GCAGAGTCTA GACTCGTGGT GGACTTCTCT CAATTTTCTA GGGGGAACAC CCGTGTGTCT 3 
00 

TGGCCAAAAT TCGCAGTCCC AAATCTCCAG TCACTCACCA ACCTGTTGTC CTCCAATTTG 3 
60 

TCCTGGTTAT CGCTGGATGT GTCTGCGGCG TTTTATCATC TTCCTCTGCA TCCT GCTGCT 4 
2 0 

ATGCCTCATC TTCTTGTTGG TTCTTCTGGA CTATCAAGGT ATGT TGCCCG TTTGTCCTCT 4 
8 0 

AATTCCAGGA TCAACAACAA CCAGCACCGG ACCATGCAAA ACCTGCACAA CT ZCTGCTCA 5 
4 0 

AGGAACCTCT ATGTTTCCCT CATGTTGCTG TACAAAACCT ACGGACAGAA ACTGCACCTG 6 
0 0 

TATTCCCATC CCATCATCTT GGGCTTTCGC AAAATACCTA TGGGAGTGGG CCTCAGTCCG 6 
60 

TTTCTCTTGG CTCAGTTTAC TAGTGCCATT TGTTCAGTGG TTCGTAGGGC TTTCCCCCAC 7 

2 0 

TGTCTGGCTT TCAGTTATAT GGATGATGTG GTTTTGGGGG CCAAGTCTGT A:AACATCTT 7 

3 0 

GAGTOCCTTT ATGCCGCTGT TACCAATTTT CTTTTGTCTT TGGGTATA2A TTTAAACCCT 3 

4 0 

CACAAAACAA AAAGATGGGG ATATTCCCTT AACTTCATGG GAT ATGTCAI TGGGAGTTGG 3 
0 0 

GGCACATTGC CACAGGAACA TATTGTACAA AAAATCAAAA TGTGTTTTAG GAAACTTCCT 9 
60 



GTAAACAGGC CTATTGATTG GAAAGTATGT CAACGAATTG TGGGTCTTTT GGGGTTTGCC 10 
20 

GCCCCTTTCA CGCAAT 3TGG ATATCGTGCT TTAATGCCTT TATATGCATG TATACAAGCA 10 
SO 

AAACAGGC TT TTACTTTCTC GCAAACTTAC AAGACCTTTC TAAGTAAACA GTATCTGAA^ 11 
4 0 

CTTTACCCCG TTGCTC 3GCA ACGCCCTGGT CTGTGCCAAG TGTTTGCTGA CGCAACCCCC 12 
0 0 

ACTGGTTGGG GCTTGGCCAT AGGCCATCAG CGCATGCGTG GAACCTTTGT GTCTCCTCTG 12 
6 0 

CC GAT CO AT A CTGCGGAACT CCTAGCCGCT TGTTTTGCTC GCAGCAGGTC TGGGGCAAAA 13 

2 0 

CTCATCGGGA CTGACAATTC TGTCGTGCTC TCCCGCAAGT ATACATCATT TCCATGGCTG 1 
8 0 

CTAGGCTGTG CTGCCAACTG GATCCTGCGC GGGACGTCCT TTGTTTAOGT CCCGT^GG^G 14 
4 0 

CTGAATCCCG CGGACGACCC CTCCCGGGGC CGCTTGGGGC TCTACCGCCC GCTTCTCCGC 15 
0 0 

CTGTTATACG GACCGACCAC GGGGCGCACG TCTCTTTACG CGGACTCCCC GTCTGTGCCT 15 

TCTCATCTGC CGGACCGTGT GCACTTCGCT TCACCTCTGC ACGTCGCATG GAGACCACCG 16 
2 0 

TGAACGCCCA CGGGAAGCTG CCCAAGGTCT TGCATAAGAG GACTCTTGGA CTTTCAGCAA 16 

8 0 

TGTCAACGAC CGACCTTGAG GCATACTTCA AAGACTGTGT GTTTAATGAG TGGGAGGAGT 17 
4 0 

TGGGGGAGGA GGTTAGGTTA AAGGTCTTTG TACTAGGAGG GTGTAGGCAT AAATTGGTGT 18 

GTTCAC 2ATO ACCATG ZAAC TTTTTCACCT CTGCCTAATC ATGTCATGTT CATGTCGTAC 18 
6 0 

TGTTCAAGCC TCCAAGGTGT GCCTTGGGTG GCTTTGGGGC ATGGACATTG ACGCGTATAA 19 

AGAATTTGGA GCTTCTGTGG AG T TACT CTC TTTTTTGCCT TCTGACTTTT TTCCTTCTAT 1 : < 

8 0 

TCGAGATCTC CTCGACACCG CCTCTGCTCT GTATCGGGAG GCCTTAGAGT CTCCGGAACA 2 0 



# 



TTGTTCACCT CACCATACGG 

0 0 

TCTAGCCACC TGGGTGGGAA 
60 

CTATGTCAAC GTTAATATGG 

20 

TCTTACTTTT GGGAGAGAAA 

3 0 

CACTCCTCCT GCATATAGAC 

4 0 

TGTTGTTAGA CGAAGAGGCA 
00 

GTCTCAATCG CCGCGTCGCA 
60 

GGACACATAA GGTGGGAAAC 

;:o 

CTAAATGGCA AACTCCTTCT 
3 0 

GTAAGCAATT TGTGGGGCCC 
A 0 

CTGCTAGGTT TTATCCAAAT 
0 0 

ATTATCCAGA GTATGTAGTT 
60 

GGAAGGCGGG GATCTTATAT 
2 0 

GATATTCTTG GGAACAA 3AT 
0 

ATGGGGACAA ATCTTTCTGT 



OCTGCATTCA AAGCCAACTC 
0 0 

TGGCCGGACG CCAACAAGGT 
60 



CACTCAGGCA AGCTATTCTG AGTTGGGGTG AGTTAATGAA 21 

GTAATTTGGA AGATCCAGCA TCCAGGGAAT TAGTAGTCAG 21 

GCCTAAAAAT CAGACAACTA TTGTGGTTTC ACATTTCCTG 2 2 

CTGTTCTTGA ATATTTGGTG TCTTTTGGAG TGTGGATTCG 22 

CACCAAATGC CCCTATCTTA TCAACACTTC CGGAAACTAC 2 3 

GGTCCCCTAG AAGAAGAACT CCCTCGCCTC GCAGACGAAG 2 4 

GAAGATCTCA ATCTCGGGAA TCTCAATGTT AGTATTCCTT 2 4 

TTTACGGGGC TTTATTCTTC TACGGTACCT TGCTTTAATC 2 5 

TTTCCGGACA TTCATTTGCA GGAGGACATT CTTGATAGAT 2 5 

CTTACAGTAA ATGAAAACAG GAGACTAAAA TTAATTATGC 2 6 

GTTACTAAAT ATTTGCCCTT AGATAAAGGG ATCAAACCAT 2 7 

AAT CAT TACT TCCAGACGCG ACATTATT TA CACACTCTTT 2 7 

AAAAGAGAGT CCACACGTAG CGCCTCATTT TGCGGGTCAC 2 3 

CTACAGCAT G GGAGGTTGGT CTTCCAAACC TCGAAAAGGC 23 

CCCCAATCCC CTGGGAT TCT TCCCCGATCA TCAGTTGGAC 2 9 

AGAAAATCCA GATTGGGACC TCAACCCGCA CAAGGACAAC 3 0 

GGGAGTGGGA GCATTCGGGC CAGGGTTCAC CCCTCCTCAT 3 0 



GGGGGACTGT TGGGGTGGAG CCCTCAGGCT CAGGGCCTAC TCACAACTGT GCCAGCAG^T 31 
20 

CCTCCTCCTG CCTCCACCAA TCGGCAGTCA GGAAGGCAGC CTACTCCCTT ATCTCCAPTT 31 
8 0 

CTAAGGGACA CTCATCCTCA GGCCATGCAG TGGAA 32 
15 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 843 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Pro Leu 3er Tyr Gin His Phe Arg Lys Leu Leu Leu Leu Asp Glu 
15 10 15 

Glu Ala Gly Pro Leu Glu Glu Glu Leu Pro Arg Leu Ala Asp Glu Gly 
20 25 30 

Leu Asn Arg Arg Val Ala Glu Asp Leu Asn Leu Gly Asn Leu Asn Val 

3 5 4 0 4 5 

Ser lie Pro Trp Thr His Lys Val Gly Asn Phe Thr Gly Leu Tyr Ser 

50 55 60 

Ser Thr Val Pro Cys Phe Asn Pro Lys Trp Gin Thr Pro Ser Phe Pro 
65 7 0 7 5 3 0 

Asp lie His Leu Gin Glu Asp lie Leu Asp Arg Cys Lys Gin Phe Val 

85 90 95 

Glu Pro. Leu Thr Val Asn Glu Asn Arg Arg Leu Lys Leu lie Met Pro 
100 105 110 

Ala Arg Phe Tyr Pro Asn Val Thr Lys Tyr Leu Pro Leu Asp Lys Gly 
115 120 125 

lie Lys Pro Tyr Tyr Pro Glu Tyr Val Val Asn His Tyr Phe Gin Thr 

130 135 140 



Arg His Tyr Leu His Thr Leu 
14 5 15 0 



Trp Lys Ala Gly lie Leu Tyr Lys Arg 
155 160 



Glu Ser Thr Arg Ser Ala Ser Phe Cys Gly Ser Pro Tyr Ser Trp Glu 

165 170 175 

Gin Asp Leu Gin His Gly Arg Leu Val Phe Gin Thr Ser Lvs Arg His 
180 185 190 

Gly Asp Lys Ser Phe Cys Pro Glu Ser Pro Gly lie Leu Pro Arg Ser 
195 200 205 

Ser Val Gly Pro Cys lie Gin Ser Gin Leu Arg Lys Ser Arg Leu Gly 
210 215 22 0 

Pro Gin Pro Ala Gin Gly Gin Leu Ala Gly Arg Gin Gin Gly Gly Ser 
225 230 235 240 

Gly Ser Tie Arg Ala Arg Val His Pro Ser Ser Trp Gly Thr Val Gly 

245 250 255 

Val Glu Pro Ser Gly Ser Gly Pro Thr His Asn Cys Ala Ser Ser Ser 
260 265 270 

Ser Ser Cys Leu His Gin Ser Ala Val Arg Lys Ala Ala Tyr Ser Leu 
275 280 28 5 

lie Ser Thr Ser Lys Gly His Ser Ser Ser Gly His Ala Val Glu Leu 
2 9 0 2 9 5 3 0 0 

His His Phe Pro Pro Asn Ser Ser Arg Ser Gin Ser Gin Gly Pro Val 
305 310 315 320 

Leu Ser Cys Trp Trp Leu Gin Phe Arg Asn Ser Glu Pro Cys Ser Glu 

325 330 335 

Tyr Cys Leu Cys His lie Val Asn Leu lie Glu Asp Trp Gly Pro Cys 
340 345 350 

Thr Glu His Gly Glu His Arg lie Arg Thr Pro Arg Thr Pro Ala Arg 
355 360 " 365 

Val Thr Gly Gly Val Phe Leu Val Asp Lys Asn Pro His Asn Thr A La 
370 375 380 

Glu Ser Arg Leu Val Val Asp Phe Ser Gin Phe Ser Arg Gly Asn Thr 
385 390 395 400 

Arg Val Ser Trp Pro Lys Phe Ala Val Pro Asn Leu Gin Ser Leu Thr 

405 410 415 



Asn Leu Leu Ser Ser Asn Leu Ser Trp Leu Ser Leu Asp Val Ser Ala 
420 425 430 



Ala Phe Tyr His Leu Pro Leu His Pro Ala Ala Met Pro His Leu Leu 

435 440 445 

Val Gly 3er 3er Gly Leu Ser Arg Tyr Val Ala Arg Leu Ser Ser Asn 

4 50 4 55 4 60 

Ser Ar:: lie Asn Asn Asn Glu His Arg Thr Met Glu Asn Leu His Asn 

465 470 475 480 



ber ^ys 



Ser Arg Asn Leu Tyr Val Ser Leu Met Leu Leu Tyr Lys Thi 
485 490 495 



Tyr Gly Gin Lys Leu His Leu Tyr Ser His Pro lie lie Leu Gly Phe 
500 505 510 

Arg Lys lie Pro Met Gly Val Gly Leu Ser Pro Phe Leu Leu Ala Gin 
515 52 0 52 5 

Phe Thr Ser Ala lie Cys Ser Val Val Arg Arg Ala Phe Pro His Cys 
530 535 54Q 

Leu Ala Phe Ser Tyr Met Asp Asp Val Val Leu Gly Ala Lys Ser Val 
545 550 555 560 

Gin His Leu Glu Ser Leu Tyr Ala Ala Val Thr Asn Phe Leu Leu Ser 

56 5 57 0 57 5 

Leu Gly lie His Leu Asn Pro His Lys Thr Lys Arcr Trp Gly Tyr Ser 
530 585 ~ 590 

Leu Asn Phe Met. Gly Tyr Val lie Gly 

59 5 60 0 60 5 



Glu His lie Val Gin Lys He Lys Met Gys Phe Arg Lys Leu Pro Val 

610 615 62 0 

Asn Arg Pro lie Asp Trp Lys Val Cys Gin Arg lie Val Gly Leu Leu 

^25 630 635 640 

Gly Phe Ala Ala Pro Phe Thr Gin Cys Gly Tyr Pro Ala Leu Met Pro 

64 5 650 655 

Leu Tyr Ala Cys lie Gin Ala Lys Gin Ala Phe Thr Phe Ser Gin Thr 

6 6 0 6 6 5 67 0 

Tyr Lys Thr Phe Leu Ser Lys Gin Tyr Leu Asn Leu Tyr Pro Val Ala 

67 5 68 0 68 5 

Arg Gin Arg Pro Gly Leu Cys Glu Val Phe Ala Asp Ala Thr Pro Thr 

69 0 695 70 0 



Gly Trp Gly Leu Ala lie Gly His Gin Arg Met Arg Gly Thr Phe Val 

705 710 715 720 

Ser Pro Leu Pro lie His Thr Ala Glu Leu Leu Ala Ala Cys Phe Ala 

725 730 735 

Arg Ser Arg Ser Gly Ala Lys Leu lie Gly Thr Asp Asn Ser Val Val 
74 0 74 5 7 50 

Leu Ser Arg Lys Tyr Thr Ser Phe Pro Trp Leu Leu Gly Cys Ala Ala 
75 5 7 60 7 65 

Asn Trp lie Leu Arg Gly Thr Ser Phe Val Tyr Val Pro Ser Ala Leu 

7 7 0 7 7 5 7 8 0 

Asn Pre: Ala Asp Asp Pro Ser Arg Gly Arg Leu Gly Leu Tyr Arg Pro 
735 790 795 800 

Leu Leu Arg Leu Leu Tyr Arg Pro Thr Thr Gly Arg Thr Ser Leu Tyr 

805 810 " 815 

Ala Asp Ser Pro Ser Val Pro Ser His Leu Pro Asp Arg Val His Phe 

820 825 8 30 

Ala Ser Pro Leu His Val Ala Trp Arg Pro Pro 
3 3 5 8 4 0^ 



(2) INFORMATION FOR SEQ I D NO : 3 : 

i i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 4 00 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

Met Gly Gly Trp Ser Ser Lys Pro Arg Lys Gly Met Gly Thr Asn Leu 

1 5 - 10 15 

Ser Val Pro Asn Pro Leu Gly Phe Phe Pro Asp His Gin Leu Asp Pro 
2 0 2 5 3 0 

Ala Phe Lys Ala Asn Ser Glu Asn Pro Asp Trp Asp Leu Asn Pro His 
3 5 4 0 4 5 



Lys Asp Asn Trp Pro Asp Ala Asn Lys Val Gly Val Gly Ala Phe Gly 



50 55 60 

Pro Gly Phe Thr Pro Pro His Gly Gly Leu Leu Gly Trp Ser Pro Gin 

65 70 75 30 

Ala Gin Gly Leu Leu Thr Thr Val Pro Ala Ala Pro Pro Pre, Ala Ser 

85 90 95 

Thr Asn Arg Gin Ser Gly Arg Gin Pro Thr Pro Leu Ser Pro Pro Leu 
100 105 110 

Arg Asp Thr His Pro Gin Ala Met Gin Trp Asn Ser Thr Thr Phe His 
115 120 12 5 

Gin Thr Leu Gin Asp Pro Arg Val Arg Ala Leu Tyr Phe Pro Ala Gly 
130 135 140 

Gly Ser Ser Ser Gly Thr Val Ser Pro Ala Gin Asn Thr Val Ser Ala 
145 150 155 160 

lie Ser Ser lie Leu Ser Lys Thr Gly Asp Pro Val Pro Asn Met Glu 

165 170 175 

Asn lie Ala Ser Gly Leu Leu Gly Pro Leu Leu Val Leu Gin Ala Gly 
18 0 18 5 19 0 

Phe Phe Leu Leu Thr Lys lie Leu Thr lie Pro Gin Ser Leu Asp Ser 
195 200 205 

Trp Trp Thr Ser Leu Asn Phe Leu Gly Gly Pro Thr Val Cys Leu Gly 
2 1 0 215 2 2 0 

Gin Asn Ser Gin Ser Gin Tie Ser Ser His Ser Pro Thr Cys- Cys Pro 
22 5 230 23 5 "24 0 

Pro lie Cys Pro Gly Tyr Arg Trp Met Cys Leu Arq Ara Phe lie lie 

2 4 5 2 5 0 ~ 2 5 5 

Phe Leu Cys lie Leu Leu Leu Cys Leu lie Phe Leu Leu Val Leu Leu 

2 60 2 65 27 0 

Asp Tyr Gin Gly Met Leu Pro Val Cys Pro Leu lie Pro Gly Ser Thr 

2 7 5 2 8 0 2 8 5 

Thr Thr Ser Thr Gly Pro Cys Lys Thr Cys Thr Thr Pro Ala Gin Gly 

2 9 0 2 9 5 3 0 0 

Thr Ser Met Phe Pro Ser Cys Cys Cys Thr Lys Pre- Thr Asp Arg Asn 
305 310 315 320 



Cys Thr Cys lie Pro lie Pro Ser Ser Trp Ala Phe Ala Lys Tyr Leu 

325 330 335 



Trp Glu Trp Ala 

340 

Phe Val Gin Trp 
355 

lie Trp Met Met 

37 0 

Pro Phe Met Pro 
38 5 



Ser Val Arg Phe 



Phe Val Gly Leu 
360 

Trp Phe Trp Gly 
375 

Leu Leu Pro lie 
390 



Ser Trp Leu Ser 
345 

Ser Pro Thr Val 



Pro Ser Leu Tyr 

38 0 

Phe Phe Cys Leu 
3 95 



Leu Leu Val Pro 

350 

Trp Leu Ser Val 

3 b 5 

Asn lie Leu Ser 



Trp Val Tyr lie 
4 0 0 



(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 212 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

Met Gin Leu Phe His Leu Cys Leu lie lie Ser Cys Ser Cys Pro Thr 
1 5 10 15 

Val Gin Ala Ser Lys Leu Cys Leu Gly Trp Leu Trp Gly Met: Asp lie 
2 0 2 5 " :-.c 

Asp Pro Tyr Lys Glu Phe Gly Ala Ser Val Glu Leu Leu Ser Phe Leu 
3 5 4 0 4 5 

Pro Ser Asp Phe Phe Pro Ser lie Arg Asp Leu Leu Asp Thr Ala Ser 

50 5 5 60 

Ala Leu Tyr Arg Glu Ala Leu Glu Ser Pro Glu His Cys Ser Pro His 

65 70 75 8 0 

His Thr Ala Leu Arg Gin Ala lie Leu Ser Trp Gly Glu Leu Met Asn 

8 5 90 ' 9 5 

Leu Ala Thr Trp Val Gly Ser Asn Leu Glu Asp Pre. Ala Ser Arg Glu 
100 105 110 

Leu Val Val Ser Tyr Val Asn Val Asn Met Gly Leu Lys lie Arg Gin 
115 120 12 5 

Leu Leu Trp Phe His lie Ser Cys Leu Thr Phe Gly Arg Glu Thr Val 

13 0 135 14 0 



Leu Glu Tyr Leu Val Ser Phe Gly Val Trp He Arg Thr Pro Pro Ala 

145 150 155 160 

Tyr Arg Pro Pro Asn Ala Pro He Leu Ser Thr Leu Pro Glu Thr Thr 

165 170 175 

Val Val Arg Arg Arg Gly Arg Ser Pro Arg Arg Arg Thr Pro Ser Pro 

13 0 18 5 " 190 

Arg Arg Arg Arg Ser Gin Ser Pro Arg Arg Arq Arg Ser Gin Ser Arg 

195 2 00 " 20 5 

Glu Ser Gl:\ Cys 
2 10 



(2) IN FORMAT 1 01 J FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 154 amino acids 
(E) TYPE: amino acid 

(C) STRANDEDNESS: single 

( D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

Met Ala Ala Arg Leu Cys Cys Gin Leu Asp Pro Ala Ara Asp Val Leu 

1 1 10 ^ 15 

Cys Leu Arg P r Val Gly Ala Glu Ser Arg Gly Arg Pro Leu Pro Gly 
2 0 2 5 3 0 

Pro Leu Gly Ala Leu Pro Pro Ala Ser Pro Pro Val lie Pro Thr Asp 
3 5 4 0 4 5 

His Gly Ala His Leu Ser Leu Arg Gly Leu Pro Val Cys Ala Phe Ser 
50 55 60 

Ser Ala Gly Pre- Cys Ala Leu Arg Phe Thr Ser Ala Arg Arg Met Glu 

65 70 75 " 80 

Thr Thr Val Asn Ala His Gly Asn Leu Pro Lys Va L Leu His Lys Arg 

^ 90 95 

Thr Leu Gly Leu Ser Ala Met Ser Thr Thr Asp Leu Glu Ala Tyr Phe 
10 0 10 5 110 

Lys Asp Cys Val Phe Asn Glu Trp Glu Glu Leu Gly Glu Glu Val Arg 

115 120 12 5 

Leu Lys Val Phe Val Leu Gly Gly Cys Arg His Lys Leu Val Cys Ser 





130 



135 



140 



Pre- Ser Pro Cys Asn Phe Phe Thr Ser Ala 



(2) INFORMATION FOP SEQ ID NO : 6 : 

;i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 36 base pairs 
(E) TYPE: nucleic acid 
(C) STPANDEDNESS : single 
(P) TOFOLOGY: linear 

(xi) SEQUENCE PES CP I PT I ON : 3EQ ID NO : 6 : 

ATAAGCTTAT GCCCCTATCT TATCAACACT TCCGGA 



(2) INFORMATION FOR SEQ ID NO : 7 : 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STPANDEDNESS : single 
iu) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

GAGTCTAGAC TCTGCGGTAT TGTGA 



(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 25 base pairs 
(E) TYPE: nucleic acid 
(C) STPANDEDNESS : single 
( 2) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

GAGTCTAGAC TCGTGGTGGA CTTCT 



(2) INFORMATION FOR SEQ ID NO : 9 : 

; i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 32 base pairs 



14 5 



150 



• 



(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

TGAGAATTCT CACGGTGGTC TCCATGCGAC GT 
32 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCF IPTION : SEQ ID NO: 10 

TTTGTTTACG TCCCGT 
16 



(2) IN FOP MAT ION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 27 base pairs 
(E) TYPE: nucleic acid 
(C) STRANDEDNESS: single 
( I' ) TOPOLOGY. Imea-L 

(xi) SEQUENCE DESCRI PTION : SEQ ID NO : 1 1 

CTAAGCTTAG TTTCCGGAAG TGTTGAT 

27 



